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Abstract 

We consider a new class of non Markovian processes with a countable number 
of interacting components. At each time unit, each component can take two values, 
indicating if it has a spike or not at this precise moment. The system evolves as follows. 
For each component, the probability of having a spike at the next time unit depends 
on the entire time evolution of the system after the last spike time of the component. 
This class of systems extends in a non trivial way both the interacting particle systems, 
which are Markovian, and the stochastic chains with memory of variable length which 
have finite state space. These features make it suitable to describe the time evolution 
of biological neural systems. We construct a stationary version of the process by 
using a probabilistic tool which is a Kalikow-type decomposition either in random 
environment or in space-time. This construction implies uniqueness of the stationary 
process. Finally we consider the case where the interactions between components are 
given by a critical directed Erdos-Renyi-type random graph with a large but finite 
number of components. In this framework we obtain an explicit upper-bound for the 
correlation between successive inter-spike intervals which is compatible with previous 
empirical findings. 
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1 Introduction 

A biological neural system has the following characteristics. It is a system with a huge 
(about 10^^) number of interacting components, the neurons. This system evolves in time, 
and its time evolution is not described by a Markov process (Cessac 2011). In particular, 
the times between successive spikes of a single neuron are not exponentially distributed 
(see, for instance, Brillinger 1988). 

This is the motivation for the introduction of the class of models that we consider in the 
present paper. To cope with the problem of the large number of components it seems nat- 
ural to consider infinite systems with a countable number of components. In this new class 
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of stochastic systems, each component depends on a variable length portion of the history. 
Namely, the spiking probability of a given neuron depends on the accumulated activity 
of the system after its last spike time. This implies that the system is not Markovian. 
The time evolution of each single neuron looks like a stochastic chain with memory of 
variable length, even if the influence from the past is actually of infinite order. This class 
of systems represents a non trivial extension of the class of interacting particle systems 
introduced in 1970 by Spitzer. It is also a non trivial extension of the class of stochastic 
chains with memory of variable length introduced in 1983 by Rissanen. 

The particular type of dependence from the past considered here is motivated both by 
empirical as well as theoretical considerations. 

From a theoretical point of view, Cessac (2011) suggested the same kind of dependence 
from the past. In the framework of leaky integrate and fire models, he considers a system 
with a finite number of membrane potential processes. The image of this process in 
which only the spike times are recorded is a stochastic chain of infinite order where each 
neuron has to look back into the past until its last spike time. Cessac's process is a finite 
dimensional version of the model considered here. 

Finite systems of point processes in discrete or continuous time aiming to describe bi- 
ological neural systems have a long history whose starting points are probably Hawkes 
(1971) from a probabilistic point of view and Brillinger (1988) from a statistical point of 
view, see also the interesting paper by Krumin et al. (2010) for a review of the statis- 
tical aspects. For non-linear Hawkes processes, but in the frame of a finite number of 
components, Bremaud and Massoulie (1994) address the problem of existence, uniqueness 
and stability. M0ller and coauthors propose a perfect simulation algorithm in the linear 
case, see M0ller and Rasmussen (2005). In spite of the great interest in Hawkes processes 
during the last years, especially in association with modeling problems in finance and 
biology, all the studies are reduced to the case of systems with a finite number of compo- 
nents. Here we propose a new approach which enables us to deal also with infinite systems 
with a countable number of components, without any assumption of the type linearity or 
attractiveness. 

This paper is organized as follows. In Section [2] we state two Theorems proving the 
existence and uniqueness of infinite systems of interacting chains with memory of variable 
length, under suitable conditions. Our main technical tool is a Kalikow-type decomposition 
of the infinite order transition probabilities which is a non trivial extension of previous 
results of the authors in the case of Markovian systems, cf. Galves et al. (2013). The 
decomposition considered here has two major differences with respect to what has been 
done before. Firstly this is due to the non-Markovian nature of the system. Secondly, 
and most importantly, the structure of the transition laws leads to the need of either 
considering a decomposition depending on a random environment or considering a space- 
time decomposition. Using the Kalikow-type decomposition we prove the existence, the 
uniqueness as well as a property of loss of memory of the stationary process. 

In Section [3] we study the correlation between successive inter-spike intervals (ISI) . This 
aims at explaining empirical results presented in the neuroscientific literature. Gerstner 
and Kistler (2002), quoting Goldberg et al. (1964), observe that in many experimental 
setups the empirical correlation between successive inter-spike intervals is very small "in- 
dicating that a description of spiking as a stationary renewal process is a good approxima- 
tion" . However, Nawrot et al. (2007) find statistical evidence that neighboring inter-spike 
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intervals are correlated, having negative correlation. We show that we can account for 
these apparently contradictory facts within our model. This requires the introduction of 
a new setup in which the synaptic weights define a critical directed Erdos-Renyi random 
graph with a large but finite number of components. We obtain in Theorem [3] an explicit 
upper bound for the correlations involving the number of components of the system, as 
well as the typical length of one inter-spike interval. For a system having a large number 
of components, our result is compatible with the discussion in Gerstner and Kistler (2002). 
Gerstner and Kistler (2002) deduce from this that spiking can be described by a renewal 
process. However, for systems with a small number of components, the correlation might 
as well be quite big, as reported by Nawrot et al. (2007) who show that neighboring 
inter-spike intervals are negatively correlated. Therefore, both features are captured by 
our model, depending on the scale we are working in. 

The proofs of all the results are presented in Sections [H [5] and [H 



2 Systems of interacting chains with memory of variable 
length: Existence, uniqueness and loss of memory 

We consider a stochastic chain {Xt)t^i taking values in {0, l}''^ for some countable set of 
neurons /, defined on a suitable probability space {Vt,A,P). For each neuron i at each 
time t € Z, Xt{i) = 1 if neuron i has a spike at that time t, and Xt{i) = otherwise. The 
global configuration of neurons at time t is denoted Xt = {Xt{i),i G /). We define the 
filtration 

= a{Xs,s eZ,s <t),t eZ. 
For each neuron i G / and each time t G Z let 

Li = sup{s < t : X,{i) = 1} (2.1) 

be the last spike time of neuron i strictly before time t. We introduce a family of "synaptic" 
weights Wj^i G M, for j ^ i, Wj^j = for all j. Wj^i is the "synaptic weight of neuron 
j on neuron i". We suppose that the synaptic weights have the following property of 
uniform summability. 

sup^ |Wj-^i| < oo. (2.2) 

Now we are ready to introduce the dynamics of our process. At each time t, conditionally 
on the whole past, sites update independently. This means that for any finite subset J C I, 
Qi G {0, 1}, i G J, we have 

P{Xt{i) = a,,i& J\Tt-i) = J]P(Xt(i) = ai\Tt-i). (2.3) 
Moreover, the probability of having a spike in neuron i at time t is given by 

P{Xtii) = l\Ft-i) = c\>i [y^W^^, "^gjit - s)XS),t- Ljj , (2.4) 

V j s=Ll I 
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where 0j : M x N — )• [0, 1] and 5^ : N — )• M+ are measurable functions for all i £ I,j €z I. 
We assume that (pi is uniformly Lipschitz continuous, i.e. there exists a positive constant 
7 such that for all s, s' S M, n S N, i S /, 



\(j)i{s,n) - (j)i{s',n)\ < 7|s - s' 



(2.5) 



Observe that in the case where the function <j)i is increasing with respect to the first 
coordinate, the contribution of components j is either excitatory or inhibitory, depending 
on the sign of Wj^i. This is reminiscent of the situation in biological neural nets in which 
neurons can either stimulate or inhibit the expression of other neurons. 

It is natural to ask if there exists at least one stationary chain which is consistent with 
the above dynamics, and if so, if this process is unique. In what follows we shall construct 
a probability measure P on the configuration space Q = {0, 1}^^^ of all space-time con- 
figurations of spike trains, equipped with its natural sigma algebra A. On this probability 
space, we consider the canonical chain {Xt)tez where for each neuron i and each time t, 
Xt{i){uj) = uJt{i) is the projection of w onto the coordinate of u. 

For each neuron i, we introduce 



the set of all neurons that have a direct influence on neuron i. Notice that in our model, 
V.^i can be both finite or infinite. We fix a growing sequence {Vi{k))ky_i of subsets of / 



such that Vi{-1) = 0, ^^(0) = {i}, Vi{k) C Vi{k + 1), Vi{k) / Vi{k + 1) ifVi{k) / V^iU{i} 
and U^Viik) = V.^iU{i}. 



We consider two types of systems. The first system incorporates spontaneous spike times, 
see Condition (j2.6p below. These spontaneous spikes can be interpreted as external stim- 
ulus or, alternatively, as autonomous activity of the brain. The existence and uniqueness 
of this class is granted in our first theorem. 

Theorem 1 [Existence and uniqueness in systems with spontaneous spikes] 

Grant conditions 112. S^) and 112.5]) . Assume that the functions (pi and gj satisfy moreover 

the following assumptions: 

1. There exists (5 > such that for all i G I , s €z M,n £ 'N, 



V. 



— 5>t — 



{j GlJ^i-.Wj^i^O} 




(2.6) 



2. We have that 



00 




(2.7) 



n=2 



where G{n) = supj Yl^=i 9i{^) ^'^'^ where 6 is as in condition 1. 



3. We have fast decay of the synaptic weights, i.e. 





4 



Then under these Conditions (K^-^K^, there exists a critical parameter 5* g]0, 1[ such 
that for any 5 > (5*, there exists a unique probability measure P on {0, 1}^^^, under which 
the canonical chain satisfies \2. 3\) and [2^. 



Remark 1 The stochastic chain {Xt)tez introduced in Theorem{l\is a chain having mem- 
ory of infinite order (cf. Doeblin and Fortet 1937, Harris 1955, Berbee 1987, Bressaud, 
Fernandez and Galves 1999, Johannson and Oberg 2003 and Fernandez and Maillard 
2004)- The setup we consider here extends what has been done in the above cited papers. 
First of all, the chain we consider takes values in the infinite state space {0, 1}''. Moreover, 
in Theorem m no summability assumption is imposed on the functions gj. In particular, 
the choice gj{t) = 1 is possible. This implies that the specification of the chain is not 
continuous. More precisely, introducing 



\ J s=Ll(x) 
where L\{x) = sup{s < t : Xs{i) = 1}, we have that 

sup -P(i,t)(l|y)| 7^ as A: 

x,y ■.x=y on Vi{k)x[t—k,t—l\ 



OO 



in the case gj{t) = 1 for all j, which can be seen by taking configurations x and y such 
that L\[x) < t — k and L\{y) < t — k. A similar type of discontinuity has been considered 
in Gallo (2011) for stochastic chains with memory of variable length taking values in a 
finite alphabet. 

As an illustration of Theorem [1] we give the following example of a system with interactions 
of infinite range. 

Example 1 We give an example of a system satisfying the assumptions of Theorem [7J 
Take I = U^, 9j{s) = 1 for all j,s, and Wi-^j = — — for some fixed a > 1, where 

II • 111 is the L^ — norm ofU^. In this case, if we choose Vi{k) = {j G Z*^ = ||j — i||i < fc}, 
we have \Vi{k)\ = (fc + 1)^, and condition \2. 8\) is satisfied, since 



El^^(^)l E m^^\]=Y.('' + ^^'fl^'''d{j■.\\j-^\\^ = l}J^_ 

k>l \jfVi{k-l) I k>l l=k 



I2d+a 

d 



.d^ I C(d) ^ (k + iy 

fc>i ^ ^ 



fc>l l=k 



as a > 1. 



The next theorem deals with the second type of system. Now we don't assume a minimal 
spiking rate. But additionally to the fast decay of the synaptic weights we also assume a 
sufficiently fast decay of the aging factor gj, see Condition (12. 9p below. This additional 
assumption implies that the specification of the chain is continuous. This is the main 
difference with the setup of Theorem [TJ 
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Theorem 2 [Existence and uniqueness in systems with uniformly summahle memory] 
Suppose that (j)i{s,n) = (j)i{s) does not depend on n. Assume conditions \2. and 112. 5\) 
and suppose moreover that 

(oo oo \ ^ 

^ \Wj^i\^g,{n)+ I^^'-^I E 9j{n)] < -, 

j^Vi{k-l) n=l j^Vi(k-l) n=fcVl / ^ 

(2.9) 

where 7 is given in i2.5\) . 

Then there exists a unique probability measure P on {0, 1}^^^ such that under P, the 
canonical chain satisfies ^2. 3\) and (2^. 

Now, for any s < t G Z, let Xl{i) = {Xs{i), Xs+i{i), . . . , Xt{i)) the trajectory of X{i) 
between times s and t. As a byproduct of the proof of Theorems [U and [2] we obtain the 
following loss of memory property. 

Corollary 1 1. Under the assumptions of either Theorem\^or Theorem\^ there exists 
a non increasing function 99 : N — IR+, such that for any < s < i G N the following 
holds. For all i £ I, for all bounded measurable functions f : {0, lit**'*! — )• 

\E[f{XimTo]-E[f{Xlm \ < {t-s + l)\\f\\^^{s). (2.10) 

Moreover, (p{n) < C-^^ for some fixed constant C. 

2. Under the assumptions of Theorem\^ suppose moreover that there exists a constant 
C > such that 

gj{n) < Ce-^"" and sup l^i^jl ^ C'e"^", (2.11) 

for all J S /, n S N, for some j3 > 0. 

Then there exists a critical parameter /?* such that if (i > (3^, 12.10\) holds with 

if{s) = Cq^ for some g g]0, 1[ depending only on (3. (2-12) 

The proof of Theorem [1] is given in Section [J] below. It is based on a conditional Kalikow- 
type decomposition of the transition probabilities (j)i, where we decompose with respect 
to all possible interaction neighborhoods of site i. A main ingredient is the construction 
of an associated branching process in random environment. The setup of Theorem [2] is 
conceptually less difficult, since in this case the transition probabilities are continuous. 
This follows from the summability of the aging factors gj . The proof of Theorem [2] relies 
on a space-time Kalikow-type decomposition presented in Section [5l 

Remark 2 The proofs of both Theorem [7] and Theorem \M imply the existence of a perfect 
simulation algorithm of the stochastic chain {Xt)t£Z- By a perfect simulation algorithm 
we mean a simulation which samples in a finite space-time window precisely from the 
stationary law P. In the setup of Theorem\^the simulation can be implemented analogously 
to what is presented in Galves et al. (2013). The setup of TheoremUl requires a conditional 
approach, conditionally on the realization of the spontaneous spike times. 
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3 Correlations between inter-spike intervals in the critical 
directed Erdos-Renyi random graph 

Wc consider a finite system consisting of a large number of neurons with random 
synaptic weights Wi^j,i 7^ j. The sequence Wi^j,i / j, is a sequence of i.i.d. BernouUi 
random variables defined on some probability space {ft, A, P) with parameter p = pN, i.e. 

P{Wi^j = 1) = 1 - P{Wi^j = 0) = pN, 

where 

PN = A/AT and A = 1 + i}/N for some < 1? < 00. (3.13) 

If we represent this as a directed graph where the directed link i — )■ j is present if and only 
if Wi^j = 1, we obtain what is called a "critical directed Erdos-Renyi random graph". For 
a general reference on random graphs we refer the reader to the classical book by Bollobas 
(2001). 

We put Wj^j = for all j. Notice that the synaptic weights Wi-^j and Wj^i are distinct 
and independent random variables. Conditionally on the choice of the connectivities W = 
{Wi^j, i / j), the dynamics of the chain are then given by 

t-i 

p'^ixtd) = i| j-t_i) = ^i^* E 9j{t - s)Xs{j)). 

j s=L\ 

Here we suppose that is a function only of the accumulated and weighted number of 
spikes coming from interacting neurons, but does not depend directly on the time elapsed 
since the last spike. 

P^ denotes the conditional law of the process, conditioned on the choice of W. We write 
P for the annealed law where we also average with respect to the random weights, i.e. 
P = ^ [P^(-)] , where E denotes the expectation with respect to P. 

Fix a neuron i and consider its associated sequence of successive spike times 

...<Sl^<...<Si<Q<S\<Si<...<Sl<..., (3.14) 

where 

S{ = M{t > 1 : Xt{i) = l},...,Si = M{t > Si_^ : Xt{i) = l},n> 2, 

and 

5^ = sup{i < : Xt{i) = 1}, . . . , 5i„ = sup{i < : Xt{i) = 1}, n > 1. 

Let us fix W. We are interested in the covariance between successive inter-spike intervals 

Lov - E [{!^k+i-^k)\^k-^k-i)\--^ Wk+i-^k)^ Wk-^k-i)^ 

for any k ^ 0,1. Since the process is stationary, the above covariance does not depend on 
the particular choice of k. The next theorem shows that neighboring inter-spike intervals 
are asymptotically uncorrelated as the number of neurons N tends to infinity. 
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Theorem 3 Assume that 112. 5]) . \2.b]) and {2.1\j are satisfied. Then there exists a mea- 
surable subset A & A, such that on A, 

\Cov'^{Sl - SI, Si - Sl)\ < ^N{1 - 6)^, 

where 5 is the lower bound appearing in Condition \2. 6]) . Moreover, 

P(A^) < e^^N-^/^. 

For large A^, if the graph of synaptic weights belongs to the "good" set A, the above result 
is compatible with the discussion in Gerstner and Kistler (2002). Gerstner and Kistler 
(2002) deduce from this that spiking can be described by a renewal process. However, 
for small N or on A^, the correlation might as well be quite big, as reported by Nawrot 
et al. (2007) who show that neighboring inter-spike intervals are negatively correlated. 
Therefore, both features are captured by our model, depending on the scale we are working 
in. 

The proof of the above theorem is given in Section [6] below. 



4 Conditional Kalikow-type decomposition and proof of The- 
orem [1] 

In order to prove existence and uniqueness of a chain consistent with (j2.3p and ()2.4p . we 
introduce a Kalikow-type decomposition of the infinite order transition probabilities. This 
type of decomposition was considered in the papers by Ferrari, Maass, Martinez and Ney 
(2000), Comets, Fernandez and Ferrari (2002) and Galves et al. (2013). All these papers 
deal with the case in which the transition probabilities are continuous. This is not the 
case here. We are dealing with a more general case in which the transition probabilities 
might as well be discontinuous, see the discussion in Remark [TJ This makes our approach 
new and interesting by itself. The new ingredient is a construction of a decomposition 
depending on a random environment. This random environment is given by the realization 
of the spontaneous spikes. 

More precisely. Condition (j2.6p allows to introduce a sequence of i.i.d. Bernoulli random 
variables {^tii),i E /,t S Z) of parameter 6, such that positions and times {i,t) with 
£,t{i) = 1 are spike times for any realization of the chain. We call these times "spontaneous 
spike times". We work conditionally on the choice of ^t(i),t G Z, i G /. In particular we 
will restrict everything to the state space 

= {x£ {0,1}^''^ : xt{i) > Ui),yi eI,teZ} 

which is the space of all configurations compatible with ^, i.e. all neural systems x such 
that every spike time of ^ is also a spike time of x. We write 

Ri = sup{s < t : ^,(i) = 1} (4.15) 

for the last spontaneous spike time of neuron i before time t. Moreover, for x G S^, we 
put LI = L\{x) = sup{s < t : Xs{i) = 1}. 
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Consider a couple {i,t) with ^^(i) = 0. In order to prove Theorem [T] we need to introduce 
the following quantities which depend on the realization of ^, namely 



t-1 



(4.16) 



--Llix) 



which is the minimal probability that neuron i spikes at time i, uniformly with respect to 
all configurations, and 



4)(0)= 1^1, 



x&SS 



t-1 



E^i^^ E 93it-s)^s{j),t-L\{x) 

' s=Ll{x) 



(4.17) 



which is the minimal probability that neuron i does not spike at time t. 

Notice that for all x G SS, L\{x) > R\. Hence in the above formulas, only a finite time 
window of the configuration x is used, and uniformly in x G SS, this time window is 
contained in {xs{j),Rt < s < t — 1, j G /). In particular the above quantities are well- 
defined. 

Now fix X G SS. For any k >0, we write for short x''7l^ {Vi{k)) for the space-time configu- 
ration 

J-i 



x\-'{Vi{k)) = (xsU) ■.Ll<s<t-l,je Vi{k)) 



and put 



r^!;\jl\x'-\V,ik))) = inf 



t-1 



E E 9j {t -s)zs{j),t- L\ {x) , 

s=L\{x) I 

(4.18) 



inf 

z^ss:z(y,{k))=x(y,{k)) 



t-1 



E^^i^^ E 9A^-s)z,{2).t-l}^{x) 

' ^=L\{x) 



.(4.19) 



In what follows and whenever there is no danger of ambiguity, we will write for short 
x(y,(k)) and rJJl^)(a|x(y,(A;))) or r[Jl^(a|x) instead oi x'^^ (y,(}i)) and rJJi^(a|2;*-i(y,(A:))). 



{i,t)v 



We put 



a 



(i,t) 



(-1) = A(,,)(-1) = E 



a=0 



a(,,,)(/c) = jnf^ (E'^J'V^I^^^^^^))) ) ' ^ ^ 0' 



\a=0 



and 



\i,t) (k) = a{i,t) (k) - a(i,t) {k-l),k>0. 



(4.20) 
(4.21) 
(4.22) 



Note that A(j j)(A;) G [0, 1] and that Ylk>-i A{j,t)(fc) = 1 almost surely with respect to the 
realization of {S,ti'i),i ^ I ,t ^ "L). 
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Remark 3 The A(j A; > 0, are random variables that depend only on the realization 
of {(,t{i),i G I,t £ Z). More precisely, for any i,t and k, A(j^t)(A;) is measurable with 
respect to the sigma-algebra a{S^s{j), Rl ^ s < t,j € /). We write X^^.^^{k),k > 0, in order 
to emphasize the dependance on the external field ^. 

We introduce the short hand notation 

p^i,t){l\x) = cpJY,Wj^^ Yl 9j{t-s)xs{j),t-Li{x)\ (4.23) 

\ J s=Li{x) J 

and 

P(i,t)(0|x) = 1 

The proof of Theorem [T] is based on the following proposition. 

Proposition 1 Given ^, for all {i,t) with ^t(i) = 0, there exists a family of conditional 
probabilities (p|^'^^(a|3;))fc>o satisfying the following properties. 

1. For all a, k > 0, SS B x p'^^j^(a|x) depends only on the variables (xs(j) '■ L\ < 

s<t-i,j eViik)). 

2. For all xeSS,k>0, pg)^(l|x) € [0, 1], pgf (0|x) + pgl;J(l|x) = 1. 

3. For all a,x,k> 0, P^^il^i^lx) is a cT(^s(j),i?j < s <t — l,j G I) — measurable random 
variable. 

4. For all X £ SS, we have the following convex decomposition. 

p^,,t){a\x) = Xi,,t){-l)p\ll]'Ha) + E A(.,t)(A;)pgf (a|x(y.(A;))), (4.24) 

A:>0 

where 



Pi.t («)- 



^ ^"A(,,)(-l)- 

From now on, we shall omit the subscript ^ whenever there is no danger of ambiguity and 
write P^(l\^ instead of p^^Jly 

Remark 4 The decomposition ((^.^^[ ) of the transition probability P{j,t)(-|x) can be in- 
terpreted as follows. In a first step, we choose a random spatial interaction range k G 
{ — 1,0,1,...} according to the probability distribution {A(j^t) (A;), /c > —1}. Once the range 

Ik] 

of the spatial interaction is fixed, we then perform a transition according to P(j which 



duction to this technique can be found in the lecture notes of Fernandez et al. (2001). 



depends only on the finite space-time configuration x^^ (14 (A;)). A comprehensive intro- 
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Example 2 Suppose that all interactions are excitatory, i.e. Wj^i > for all i ^ j. 
Suppose further that gi{s) = 1 for all s £ N,i £ I, and that <I>j(s,n) = ^i{s) does not 
depend on n and is non- decreasing in s. Then 

(1) = ^^ (^W,^,Ct-i{j)^ and r{-;l(0) = 1 - (^W,^^{t - Rl) 

Hence, 

A(.,t)(-1) = 1 + Wj^iCt-i{j)j - Wj^i{t - Ri) 

Similarly, 

(t-i t-i 

j^ik) m=L\{x) j&Vi{k) s=Li{x) 

and 

r{Jl)(0|rE) = l-$, j ^ W,^,{t-Li{x))+ J] TV,^, |] | . 

Proof of Proposition [T] We have for any > 1, a G {0, 1} and x G 55, 

P{i,t){a\x) = ry^^]{a) + [j^ /\f}^^^{a\x{Vi{k))) + {pi^i^t){a\x) - r^^}){a\x) 

\k=0 ' / 

where 

a[J1 )(a|x(y.(A:))) = rj^ )(a|x(F.(fc))) - rfr^ {a\x{V,{k - 1))). 
Now, due to condition ()2.5|) . 

|P(i,i)(ak) - '■(^4)(ak)l < 7 E '^^P**^* ~ ^) I E l^j^i|-^0 

as N ^ CO due to (12. 2p . In the above upper bound we used that 

t-i t-i 

E ~ •s)N^(i) - Xs{j)\ < E '5^P5'j(* - s) < oo 

almost surely, which is a consequence of ()2.7p . Therefore we obtain the following decom- 
position. 

oo 

Pii,t){a\x) = ry (a) + E (a|x(F,(A;))), a G {0, 1}, for all x G 55. (4.25) 

k=0 
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Now let 

[-1]. X ^y^") 

Moreover, for A; > 0, put 

\,^t){k,x{Vm) = Y.AVf^^^{a\x{V,{k))), (4.26) 

a 

and for any {i,t),k such that A(j^()(A;, x(Vi(A;))) > 0, we define 

Ar\Mx{Vi(k))) 
A(,,)(fc,x(y,(fe))) 

For {i,t),k such that A(j ()(A;, x(Vi(A;))) = 0, define p'p^^{a\x{Vi{k))) in an arbitrary fixed 
way. Hence 

oo 

Pi.,t){a\x) = X^,,t)i-l)p^^i]{a) + Y.\^,t){k,x{Viik)))^^^^^^ (4.27) 

fc=0 

In (^^727\) the factors \i^t){k,x{ViXk))),k > 0, stih depend on x^~^{Vi{k)). To obtain the 
desired decomposition, we must rewrite it as follows. 

For any {i,t), take the sequences a(^i^t){k) , X(^i^fj{k) , k > —1, as defined in ()4.2ip and (j4.22p . 
respectively. Define the new quantities 

a^,^t){k,x{Vi{k))) = Y,\i,t){^,xm))) 

l<k 

and notice that 

a^,^tj{k,x{V,{k)))=Y,r^,t)ia,xmk))) 

a 

is the total mass associated to r|^^^^(-, x(T^(A:))). From now on and for the rest of the proof 
we shall write for short a(j j)(/c,x) instead of writing a(^i^t){k, x{Vi{k))) . 

Reading (14.27P again, this means that for any k > 0, we have to use the transition proba- 
bility on the interval ]a(i,t)(A: - 1, x), Q(i^j) (A;, x)]. 

By definition of a(i^f^{k) in (I4.2ip . a{i^t){k) is the smallest total mass associated to 
uniformly with respect to all possible neighborhoods x{yi{k)). Hence, in order to get the 
decomposition ()4.24p with weights A(j f)(A;) not depending on the configuration, we have 
to define a partition of the interval [0, a(j_t) (A;, x)] according to the values of a(j^t)(^) 
we have to define probabilities P^^}^-^ working on the intervals ]a(j f)(A; — 1), a(j_t) (A;)]. This 
can be done as follows. 

Fix /c > and suppose that for some I' < I < k — 1, 

"(i.t) {I' -l,x) < ai^i^t) (k-l) < a(i_4) (/', x) < . . . < 

< ai^i,t){l,x) < a(^i,t){k) < a(i,t)(/ + l,x). 
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and therefore 



( U - ) U ]a(i,t)(^,a;),a(i^t)(/i;)]. 

\m=l'+l I 

Hence the probabihty that has to be defined on the interval ]a(j t)(A; — 1), a(j (/c)] 
has to be decomposed according to the above decomposition into sub- intervals. On the 
first interval ]a(j_t)(A; — 1), a(j x)], we have to use the original probability v\iiy on 

each of the intervals ]a(i,f)(m — 1, x), Q;(j^i) (m, x)], we have to use p'™]), and finally on 

]a(i,t)(^2;),a(i^t)(/c)], we use pj^^J'. 

This yields, for any A; > 0, the following definition of the conditional finite range probability 
densities. 

k-l 

~ ''^{a(i,t)(''-l,a:)<«(i,t)(fc-l)<a(i,t)('',a:)}^{a(i,t)(',2:)<a(i,t)(fe)<Q(i,t)('+l,a;)} 
-l=l'<l 

P(,,)(a|x(F.(/ ))) 



\i^t){ m,x{V,{m)) 



+ L X;-(^^ P(,,)(a|x(y.(m))) 

m=l'+l ^ 



+ 



\i,t)(k) ^'-'^ 



. (4.28) 



\k] 

Note that by construction, the above defined probability P(j depends only on the con- 
figuration x{Vi{l + 1)), hence at most on x{Vi{k)), since I < k — 1. Multiplying the above 
formula with A(j_t)(A;) and summing over all k shows that the decomposition (j4.27p implies 
the desired decomposition (j4.24p . This finishes our proof. • 



Thanks to condition (j2.5p . the following estimate holds. It will be useful in the sequel. 
Lemma 1 We have for all k > 1, 

t-i 

4,*)^^) ^ ^ E \WJ-,^\Y.9jit-s){l-Uj)) 
imik--^-) s=R\ 

\s=Rl ^ I im{k-l) 

We use this estimate in the 

Proof of Theorem [1] We work conditionally on the realization of the process G 
Z, i G /. Take a couple {i,t),i £ I,t £ "Z, such that £,t{i) = 0. We have to show that it 
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is possible to decide in a uniquely determined manner if t will be a spike time or not. 
In order to achieve this decision, we have to calculate where x is the unknown 

history of the process. 

We will construct a sequence of sets {Cn'^^)n, Cn'^'^ C /x] — oo,t — 1], which contain the 
sets of sites and anterior spike times that have an influence on the appearance of a spike 
at time t for neuron i. The choice of these sets is based on the decomposition (j4.24p . 

First, for any couple (j, s) with (,sij) = 0, we choose, independently from anything else, an 
interaction neighborhood V(^j^s^ = Vj{k),k > —1, with probability A^^. ^^(fc). Here, we put 
Vj{—1) = 0. A choice k = —1 for a couple (j, s) implies that we can immediately decide 
to accept a spike at time s for neuron j with probability 



(1) 



and to reject it with probability 



(0) 



We suppose that the choice of all V^j ,,) is fixed. Then by (j4.24p . the decision concerning 
the couple {i, t) depends on the configuration of the past x^~^(V(j^()). Since we do not know 

LI, we use the a priori estimate on LI which is given by Rl- Thus we consider the worst 
case in which we have to evaluate the past up to time R]. 

In order to decide about the length t — LI, we have to assign values or 1 to all couples 
{i,s),Rl < s < t. All these couples are influenced by their associated interaction region 



• So we consider 



s-l 



Cf''^= U U [J{{j,u),^,{u) = 0}. 



(4.30) 



=Ri + lj&V(i,s)\{i} u=Rl 



(i t) 

It is clear that if we know the values of all couples belonging to (7| then we are able 
to assign a value to any {i,s),R\ < s < t. Therefore, we call cj*'*^ the set of ancestors of 

generation 1 of {i,t). Continuing this procedure, any couple {j,s) € cj*'*^ itself has to be 

(i s) 

replaced by the set of its ancestors . We put 

/ 



\C 



(4.31) 



where Cp'*^ is defined as in ()4.30p . and then recursively. 



U CI 



{i,t) 

n-l 



(4.32) 



■,{i,t) 



We will show below that almost surely there exists a first time n < oo such that C„ 
Then necessarily C^*l*| consists only of couples (j, s) which chose an interaction neighbor- 
hood of range —1 or which interact only with couples representing spontaneous spiking. 
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Thus we can decide to accept or to reject a spike for any of them independently of any- 

(i t) 

thing else. Once the values associated to all elements of C^^{ are known, we can realize 

(i t) 

all decisions needed in order to attribute values to the elements of C^lj- this way, we 

(i t) 

will be able to assign values in a recursive way to all ancestor sets up to the first set, C| , 
which allows us finally to assign a value for neuron i at time t. We call this procedure the 
forward coloring procedure and we call Yt{i) the value of neuron t at time t obtained at 
the end of this procedure. 

1. We first show that the probability constructed above is the law of Xt{i) under the 
unique invariant measure P. Put C^'*^ = |Jn>i C*™ and let 

T^^%p = inf{s : C7^'*) Cl x [t - s,t - 1]}. (4.33) 

Fix some initial space-time configuration t] G {0, such that r]Q{i) = 1 for all i G I. 
Let be the chain that evolves according to ()2.3p and ()2.4p . conditionally on = rj. 

Recall that lt(i) is the value obtained at the end of the above described forward coloring 
procedure. Then for any / : {0, 1} — M+, 

EifiXni)) = E{f{X^,{i)),Tlflp<t) + E{f{X^,{i))MsfoP>t) 
= E{f{Yt{i)),Tlflp<t) 

+E{f{r>{i)),Tlflp>t). (4.34) 

But 

E{f{X'S{i)),Tlflp >t)< \\f\\ooP{T^i'ljp > t) ^ as t ^ CO. 
Hence we obtain that 

lim E{f{X?m = E{f{Ytm, 

i— >oo 

since lj„(i,t) .i ~^ ^ almost surely. 

This implies that P is the unique invariant measure of the process. 

2. We now show that the above procedure stops after a finite time almost surely. For that 
sake we put 

N^^^Ip = mm{n : C7^^'*) = 0}. (4.35) 

(i t) (i t) 

Our goal is to show that X^^Qp < oo almost surely. Notice that this implies that T^^Qp < 
oo as well. In order to do so, let 

\CtKn>l, 

be the cardinal of the set of ancestors after n steps of the above procedure. Then 

P{Nl^lp>n)<E{\Ct%. (4.36) 

As a consequence, it is sufficient to show that E{\Cn'^\) — )• as n — )■ oo. For that sake we 
compare (|Cn'*^|)„ to a branching process in random environment, where the environment 
is given by the i.i.d. field ^. 

Recall the definition of G{n) = X]m=i ^^Pi ^^^"^ upper bound (|4.29p . Thus for 

any k>l, 

\s=R\ ^ ) jm{k-i) m^{k-i) 
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Let 

\^,t)(.k)=^G{t-Ei) J2 l^i^^l- (4.37) 

j^V,{k-l) 

Notice that ^(i^t)ik) depends only on the reahzation of the i.i.d. field ^ through the value 
oft-Ri 

Since A(j^t)(A;) < A(j^t)(^) for all /c > 1, it is possible, by standard branching process 
coupling arguments, to construct a sequence (C'n'*^)n coupled with the original sequence 
(Cn'*^)n satisfying the following properties. 

1. Ci''^^ is defined through (O0]l and (|i3T]) by using the family (A(jt)(^)) instead of 
(A(M)W)- 

2. For ah n, \ct^''\ < 

Hence it is sufficient to show that E{\Cn'^^\) tends to as n — )• oo. In order to do so, we will 
control the reproduction mean depending on the environment ^. Here, reproduction mean 
stands for the mean number of sites belonging to c['''^\ for any fixed couple (j, s), where 
the expectation is taken with respect to the choices of the interaction neighborhoods V(i,s), 

~ (it) 

conditionally on the realization of ^. More precisely, given Cn = c, the reproduction mean 
of any couple (j, s) belonging to is given by 

s / s-l 

where denotes expectation conditionally on ^. 

In what follows we upper bound the above expression. We first use that, by definition 
(j4.37p . for s < s, Aq- < X(^j^g){k). Therefore, 



E^{\cl^''^\\{j,s)eCt^ = c) 



< (c,-i?i)^A(,,,)(A;) ( E [l-e«(O]%«)0c} 

Recalling the explicit form of Aq- s)(/c) in (I4.37P and using moreover the upper bound 

s-l 

E [l-^«(0]l{(/,«)^c} <s-R'j, 

u=Ri 
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on the event Rj < s — 1, we obtain 



(\c[^''^\\{j,s)eCt^ = c) 



<27 1 



{Ri<s-1} 



k>l 



yl^Vj{k-l) 



k>l \l,^Vjik-l) J leV.j(k),l^j 

=--rnu,sy (4.38) 

Observe that m^. depends on 4, but to avoid too cumbersome notation, we omit the 
superscript ^. 

Taking conditional expectation, conditionally with respect to ^, we get 



m 



(4.39) 



Therefore, taking expectation with respect to 



Now note that the event 



depends only on R^. Hence, recalling (I4.38p . the above event is independent of "^-(^"5)^ 
which depends only on s — i?s- As a consequence. 



(4.40) 



E{1,. .^^uu^fr^"^ = P ((j» G C^::!l) E{mf;l). 



But 



Y,5{i-8r-WG{n)Y,mk)\\ E i^'-^-i 

liVj{k-l) 



n=2 



We put 



fc>i 



+5G(i)j; Yl m-,,\]\vj{k)\ii-5) 

k>l \lifVj{k-l) 



C^:=27supJ]|y,(A:)| ^ liy^^.-l 
fc>i V/^VjCfc-i) 



(4.41) 



(4.42) 
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and define 

oo 

E{G, 6) = G{1) + ^(1 - (5)"-2„2^(n). 

n=2 

Then 

E[fhl"jl] < (1 - S)E{G, S) =: e{5). (4.43) 

Note that e{S) is decreasing as a function of 6 and tends to for 5 — )■ 1. In particular, if 
we put 

6* = mi{6 G]0, 1[: e{6) < 1}, (4.44) 
then e{6) < 1 for all 6 > 5^:. Now we may iterate (|4.4U|) and (|4.43p and obtain 

E{\Ct^\) < = E{\cl::!l\)e{6) < e{5r ■ (4.45) 

Since e((5)" — )• as n — )• oo for 5 > (5=,, , this implies our result. • 

Remark 5 Note that the value of 5* given in ^4 -441 ) ca?^ be explicitly calculated depending 
on the specific structure of the aging functions gj{s). For instance, if gj{s) = 1 for all j,s, 
then G{n) = n for all n, and the value of 5* follows from standard evaluations of the third 
moment of a geometrical distribution. 

Remark 6 The above proof uses a non-trivial extension of the so-called Clan of ancestors 
method employed by Fernandez et al. (2001) and Fernandez et al. (2002). In these papers 
the authors study the Clan of ancestors of a given vertex (or object) in a random system. 
They prove that this set is almost surely finite and deduce a perfect simulation algorithm. 

Proof of Corollary [1] part 1. We prove the first part of Corollary [TJ We keep the 
notation of the proof of Theorem [TJ Put C,^'*'' = ci* . Then the random variable 
Tgj^Qp defined in (j4.33p can be trivially upper bounded by 

TUlp<\Cli^\ (4.46) 

which is the total number of elements appearing in the ancestor process. This is a very 
rough upper bound on the number of steps that we have to look back into the past in 
order to choose a value for Xi{t). 

By construction, the value Xi{t) depends on all choices of interaction regions V{j^s) such 
that (j, s) G C^'*^ and on the values of the i.i.d. Bernoulli field for t — T^^^Qp < u < t. 
Moreover, for any {j, s) G C^'*^ , a random decision has to be made whether to associate 
the value +1 or to this couple. These decisions can be realized based on a sequence 
of i.i.d. uniform random variables {Ut{i),i £ I,t €z Z), which are uniform on [0, 1]. As a 
consequence, Xt{i) is measurable with respect to the sigma-algebra 

C7{V(,, UsU) : (i, s) G Ct'\^u ■■ t - T^flp <n<t]. 

Writing 

R= \ni iu-T^^lp), (4.47) 

u^\s,t\ 
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we deduce that 

Xl{i) is cr{V(j», ■.jeI,R<u< t} -measurable. (4.48) 

Let A G J-Q. We write for conditional expectation, conditionally with respect to ^. 
Then we have 

= E[f{Xlii))lAl{B>o}] + E[fiXlmAl{R<o}] 

= E[E^[f{XimAl{R>o}]] + E[E^[f{Xi{i))lAl{R<o}]]- 

Clearly, /(^s(«))l{_R>o} ^^^^ A are independent, which follows from (j4.48p . Hence 
E^f{Xi{i))lA] = E^[f{Xim{R^o}]PHA) + E^[f{Xi{i))lAl{R<o}]. 

We take expectation with respect to ^ and use the fact that £'^[/(X*(i))l|^>o}] is mea- 
surable with respect to (t{Cu : 1 < u < t} and P^{A) is measurable with respect to 
: n < 0}. Hence by independence 

E[f{Xlii))lA] = E[fiXlii))^n>o}]PiA) + i?[/(X*(i))Ul{K<o}], 
and therefore 

E[f{Xl{i))lA] - E[f{XlmP{A) = E[f{Xl{i))lAl{R<o}] ~ E[f{Xl{i))^j,^o}]PiA). 
As a consequence, 

\E[f{Xl{i))lA]-E[f{XimP{A)\ < \\f\\^P{A)[P{R<0\A) + P{R<0)] 

< \\f\\^PiA)[PiR<l\A) + PiR<0)] 

< 2\\f\\^P{A)P{R<l). 

Here we have used that {i? < 1} is independent of A. Indeed, the event < 1} does only 
depend on choices of Vq-^s) and ^s(j) having time component s > 1. Now we conclude as 
follows. By definition of R, 



P{R<1)< PiTii%>s-l). 



Using (j4.46p . we obtain 



PiT^T^P >s-l)< P{\Ct^\ >s-l)<E \Ct)\) < 



1 - l-e(5)s-l' 



since 



OO -. 

E{\ct^\)<Y.E{\cl:'''^\)<Yzr^y 



n=l 

where we have used (j4.45p . This implies the result. 
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5 A space-time Kalikow-type decomposition and proof of 
Theorem [2] 



The additional condition ()2.9p allows to introduce a space-time Kalikow-type decompo- 
sition without the assumption of existence of spontaneous spikes. We will decompose 
with respect to increasing space-time neighborhoods Vi{k) x [—k — l,—l],k > —1, where 
Vi{-1) = 0. Write S = {0, for the state space of the process and introduce, analo- 
gously to (|4l6]) ~ (l4T9]l . 

-1 

r^r'\l) = infJ>,{Y^W,^, 9j{-s)xs{j)), (5.49) 



-1 



r\ ^l(O) = inf 1 - ci>,(5^ Wj^, Yl 9ji-s)xsij)) , (5.50) 

-1 

rf\l\x)= inf MY,W,-,^ 9j{-s)zs{j)), (5.51) 

rf\l\x)= inf Ii-MY^o^^ E 9ji-s)zs{j))] , (5.52) 

.^s-M^i)=x^i-i) y ^ ^^^^^^ j 

and then for any k>l, 

-1 



^f(lk)= inf ^,{YWJ^^ Y 9j{-s)zs{j)), 

J s=Lj,(2) 



(5.53) 



-1 



rf'(0|x)= inf l-$,(VVF,-^i V 5,(-s)zJj)) 

(5.54) 

Putting 

«,(-!) = A.(-l)=rl-^](l)+rl-^](0), 
ai(A:) = inf (^rfl(l|x)+rfl(0|x)) ,A,(A:) = a,(A;) - Qi(A; - 1), A; > 0, (5.55) 
we obtain the following space-time Kalikow-type decomposition for 

\ i s=Ll{x) j 



Proposition 2 Under the conditions \2.S\) . \2. 5j) and (E^, for any i G I, the above 
defined quantities Xi{k) take values in [0,1] and 

Y ^^(^) = 1- 

A;>-1 

Moreover, there exists a family of conditional probabilities {p^lf^-j{a,\x))k>o satisfying the 
following properties. 
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1. For all a, S B X p'^ j^(a|x) depends only on the variable xt-i{i). 

2. For all a, k > 1, S B X p|^'^^(a|x) depends only on the variables {xs{j) '■ t — k—1 < 

s<t-i,j eViik)). 

3. For allx£S,k> 0, G [0, 1], p[jyO|x) +^[^^1^) = 1. 
4- For all x G S, we have the following convex decomposition 

P{i,t){a\x) = Ai(-l)p^^(a) + '^Xi(k)p^lf^^^{a\x), (5.56) 

fc>0 

where 

Pm(^)- A,(-l)- 

The proof of this proposition fohows the hnes of the proof of Proposition [TJ Moreover, the 
following estimates hold. 

oo 

Ai(0) <7j;|W,->ilE5iH, (5.57) 

j n=l 

and for all /c > 1, 

(oo oo \ 

jm{k-l) n=l iGV,(fe-l) n=k ) 

Proof of Theorem [2] We construct again a sequence of sets {Cn'^'^)n C /x] — oo,t — 1] 
which contain the sets of sites and anterior spike times that have an influence on the 
appearance of a spike at time t for neuron i. The choice of these sets is based on the 
decomposition ()5.56p . 

First, we choose for any couple (j, s), independently from anything else, a space-time 
interaction neighborhood Oq- C I x [— oo, s — 1] 

{Vj{k) X [s — A;— l,s — 1] with probability Xj{k), k > 1, 

{j} X {s — 1} with probability Xj{0) 

with probability Aj(— 1). 

Then we put 

/ \ 



\ (C7f'*)u...UCi'l*J) . (5.59) 



I (i t)\ 

The process \Cn' \ can be compared to a classical multi-type branching process with 
reproduction mean 

m^ = Xi{0) + Y,ik + l)mk)\Xi{k). 

k>l 
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Now, condition (|2.9p together with the estimates (|5.57|) and (|5.58|) shows that m := 
supjmj < 1. As a consequence, 

Pi^STOP >'"')< < m" ^ as n ^ oo, 

and this finishes the proof. • 

Proof of Corollary [1] The proof of the first part of Corollary [1] is analogous to the proof 
under the conditions of Theorem [TJ We give the proof of the second part of the Corollary. 
We keep the notation of the proof of Theorem [2l We define the projection on the time 
coordinate 

T(C)q-,)) :=s-k-l if Oq-,) = Vj{k) x [s - k - 1, s - l],k > 0,T{Oq^s)) ■= s else. (5.60) 
Define recursively 

Tf'*) = r(0(,,)),r(^'*) = min{r(0(,.3)) : (j» G c^}^^ > 1, 
and let finally 

rp{i,t) rp{i,t) 

^STOP ^ Ar{«,t) _i- 
"stop 

Then as in the proof of the first part of the corollary, 

\E[f{Xl{i))lA] - E[f{Xl{i))]P{A)\ < 2\\fUP{A)P{R < 1), 



where 



We have 

p{R<i)< pi4T%<^), 

where 

PiT^T^P < 1) = Pi^ - PsTOP 

In what follows, C will denote a constant that might change from line to line, but that 
does not depend on /3. 

We wish to compare u—Tg^Qp to the total offspring of a classical Galton- Watson branching 
process. In order to do so, notice first that using (I5.57P and (|5.58p . we get for any k >1, 

-/3{fc-l) 
A.(A:) < C-^—^ =: A(fc), 

where we have used (|2.1ip . Moreover, 

A.(0) < Cj^^ =: A(0). 

It is immediate to see that for /3 sufficiently large, X]fc>o A(A;) < 1, since 

C ' 



A:>0 
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as /3 — )• oo. Therefore we can couple u — T^j^Qp with the total offspring 7" of a classical 
Galton- Watson process, where each particle has k + 1 offspring, A; > 0, with probability 
A(/c), and offspring with probability 1 — J2k>o ^(^)- This coupling can be done such that 

u — Tg^Qp < T. Thus it is sufficient to evaluate the law of the total offspring T. For that 
sake let Wn be a random walk starting from at time 0, with step size distribution ry, 
where 

_ J — 1 with probability 1 — J2k>0'^i^) 
^ \ k with probability X{k),k > 0. 

Then T = T_i = inf{n : Wn = — !}• Hence for any A g]0, /3[, since u > s, 
P{u - T^sTOP >u-l)< P(r_i > n - 1) < P(r_i > s - 1) = P(T_i > s-2) 

< p(e^^-2 >i)< ^{\y-^^ 

where 

fe>0 

Now, fix A = 1, then there exists such that for all /? > /S*, V'(l) < 1- Putting g = ip{l) 
and C = Q~'^ yields the desired result. • 



6 Proof of Theorem [3] 

In order to prove Theorem [3l we introduce the sequence of sets 

V^^. = {j : Wi^j = 1}, . . . , Vr^. = {j : 3k E V^'' : W^^j = l},n> 2. 

Note that j £ Vi^. if and only if neuron i has a direct influence on the spiking behavior 
of neuron j. We put 

t' = inf{n : i G V^^.}. 

This is the first time that an information emitted by neuron i can return to neuron i itself. 
Recall that A = 1 + -d/N and define ^ = -^^^A. We have the following lower bound. 

Proposition 3 For any k the following inequality holds. 

Hr' < ^ < V (4) ■ 

Proof Put 

Vr_. = {j^i:3ke VI'-' : Wk-^j = l},n> 2, V}^. = Vh.. 

The sequence of sets Vl'_^.,n > 1, equals the original sequence Vf_^.,n > 1, except that we 
excluded the choice of i itself. On {r* > /c}, clearly Un<fe-iK'^>- ~ Un<fe-iK'^-' ^^"^ 
can write 

p{T'>k) = p{wj^i = {)y U vr_,.|. 

\ n<k-l I 
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Since in the definition of V,^., no choice W.^i has been made, we can condition with 
respect to Un<fe-iH^ ' ^^^^ ^^^^ j ^ Un<fc-i^l^ ' random variable 

Wj-^i is independent of Un<fc-i ''^i^ ' ^^'^ obtain the following equality 



PiT'>k)=E (I-PN 



ilUi< 



, V" I 

-1 ^z-^- I 



We conclude as follows. We can couple the process |Vj^.|,n > 2, with a classical Galton- 
Watson process Zn,n > 2, starting from Zi = V^^,, such that jV-l^.j < for all n > 2. 
The Galton- Watson process has offspring mean /j. = {N -2)^. Here, the factor N -2 
comes from the fact that any j has N — 2 choices of choosing arrows Wj^., since j itself 
and i are excluded. 



Therefore, 



P{t' >k)=E f(l -pAr)IUi<n<fc-i'^f-^ l] > E [(1 -piv)^"=i^" 



Write = Zi + . . . + Z^^i and let E{s^''-^), s < 1, be its moment generating function. 
Using the convexity of the moment generating function, we have that 



£:(s^'=-i)>i + ^(Sfc_i)(s-i). 



Using that E(Zi) = ^j^^ and that the offspring mean equals the claim follows from 

A^- 1 



N 



-A 



.fe-2 



< A + .-. + A'^"^ < (it - 1)A 



ik-l 



1 + tJ,+ ... + 

since < A and A > 1. Hence, evaluating the above lower bound in s = 1 — pn, we obtain 

P{t' > k) > 1 - pNik - l)X''-\ 

and therefore. 



P{t' <k)< pN{k - 1)X^-^ = 



k 



N 



since pN = X/N. Using that X = 1 + -d/N, we obtain the assertion. • 

In what follows, az\ denotes the finite sequence (a_fc, . . . , O-i). In particular, the notation 
aZjlO'^~^ denotes the sequence given by {a^i, . . . , a_i, 1, 0, . . . , 0). We write for short 

pW^)(a|a4) = = a\Xt'{i) = aZD 

for the transition probability of neuron i, given a fixed choice of synaptic weights W. 
However, conditionings will be read from the left to the right. In particular, we write 



Xi(i) = 0,Xo(i) = l,Xl/(i 



The following proposition shows that on the event {r* > k + l}, the two transition proba- 
bilities p(^'^)(l|0^-^lal/) and p(^'^)(l|0^-^l) necessarily coincide. 

Proposition 4 For any k> 1,1 > 1, 

{p(W'i){l\0''-haZ]) / p^^''\l\0''-h)} c{T'<k + I}. 
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Proof Let W be fixed. From now on, since we will work for this fixed choice of W, we will 
omit the superscript W and write for short p*(a|a~^) instead of p(^'*)(a|a~^). We have 

xt\j) = ztHj)yj^v.) 

(fc— 1 
jeVi s=o 



Thus, 



k-l 



The same calculus shows that 

/ fc— 1 

p\i\o'-H) = Y^ Yl '^M E E5^(^ - 

• P'^iXtHj) = zt\j),yj G V.|X*^-i(i) = 10^-1). 

This shows that in order to ensure that p'{l\0^^^1aZl) = p'{l\0''-^l), it is sufficient to 
have 

p'^ix^-Hj) = z',-Hj),yj G ViixtHi) = lo'-') = 

= P^K-^(j) = ztHj)yj e V.\X'_tH^ = aZlw'-'), (6.62) 
for all possible choices of ZQ^^{j),j G Vj, which is implied by r* > /c + / . • 

Proof of Theorem [3] In this proof, without loss of generality and to simplify the pre- 
sentation, we suppose that is the canonical state space of W. We will use the spon- 
taneous spike times {n G Z : ^„(i) = 1} introduced in the proof of Theorem [H in Sec- 
tion m above. We recall that these are independent Bernoulli random variables with 
P{^n{i) = 1) = S for ah i G {1, . . . , iV}, for all n G Z. Write I = sup{n < S"! : = 1} 
and r = inf{n > : Cnii) = !}• Put 

A = {r' > 2k{N)}, 

where k{N) is such that k{N) — )• oo as — )• oo and k{N) < N. We will fix the choice of 
A;(A^) later. We have for any realization oi W £ A, 

E^[{Sl - Sl){Sl - S\)] < E^[{r - Sl){Sl - Ol{/<s^-fc(Af)}u{r>s^+fe(Af)}] 

+ E^[{Sl - Sl){Sl - 5'i)l|^>5.i_fc(^).,,<5.i^_;j(^)|]. (6.63) 
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Using that conditionally on 5*2, r — S\ and S\ — I are independent and geometrically 
distributed, we obtain a first upper bound 

E^[{r - SDiSl - /)l{K5^-fc(7V)}u{.>5^+fc(7V)}] < ^(KN) + 2)(1 - 5)'=^. (6.64) 
We now consider the second term and use that r* > 2k{N). We have 

E^[{Sl - Sl){Sl - 5'l)l|i>5»_fc(7v);r<5'^+fc(Ar)}] 

^ ^£'^[(£'3 - t){t - 5'{)l|;>f_fc(7v);r<t+A:{Ar)}l{55=t}] 
t 

= ~ '^l)l{'>i-fcW}l{S5=t}-^^[('^3 - *)l{r<m(Af)}l^t-fc(Af)] ' (6-65) 

t 

where 

Gl^N)=^{Xs{i):t-k{N)<s<t}. 

Now, since S3 < r, 

-^^[('S's - *)l{r<t+fc(Af)}l^t-fc(Af)] = 

k{N) k{N) 

^ n X P'^IS^ _ t = n; r < t + kiN)\gl,^^^) < J] n x P^(S| - t = 

n=l «=1 

Notice that 

Now we use Proposition [H Since we are working on {r* > 2k{N)}, we have 

p\o\ixl:l^^^) = p\o\i), . . . ,p^(i|o"-Hx*:,\^)) = pXi|o"-H), 

for all n < k{N). Therefore, 

E^[{Sl - t)l{r<t+k(N)}\Ol_k(N)] 
k{N) 

< ^ n X p*(0|iy (0|01) X ... X p^(0|0"-2iy (i|o"-il) 



n=l 

00 

< X pX0|iy(0|01) X ... X p^(0|0"-2iy(i|o"-il) 

n=l 

= E^{Si-S\). (6.66) 
We conclude that on A, using successively (|6.63p - (|6.66p . 

E^[{S{ - Sl){Si - S\)] < ij{k{N) + 2)(1 - ,5)^(^) + E^{Si - Si)E^ {S^ - S{). 
In a second step, we are seeking for lower bounds. We start with 

E^[{s\ - si){si - s\)] > e'^ksi - sDisi - sl)l^l^g^^_^J^y^J.^g^^_^_^^-^y]. (6.67) 
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Then on {S* = t}, 

k(N) 

E^^iiSl - t)^r<t+kiN)}\GU(^N)] = E " ^ ^"^("^s -t = n;r<t + k{N)\gl,^^^] 

n=l 



> n X - t = n|g^,(^)) - kiNfp'^ir > t + k{N)\gl 



k{N)/ 



n=l 



/k(N) \ 



Now, on {Si = t], 



k{N) 

Y,nxP^{Sl-t = n\gl,^^^) 

n=l 

= E'^iSl - S',; Sl-S'2< HN)) = E'^iSl - S\) - E^^ {S\ - S\- SI- Sl> k{N)) 
> E'^iSl - Si) -E'^ir- Sl;r - S', > k{N)) 

> E^{Si - Si) - ^{k{N) + 2)(1 - 

Therefore, for any reahzation W G A, 

E'^iiSl - Sl){Sl - SI)] > E'^iSi - SDE'^iSl - S\) - [^{k{N) + 2) + k{N)\l - 5)'=^. 
Putting things together and supposing that k{N) + 2 < k{N)'^ , we obtain finally 
\E^[{Si - Si){Si - S\)] - E^{Si - Si)E^{S\ - Si)\ < ^k{N)^{l - 6)'''-^l 

It remains to find an upper bound for ¥'{A'^). Clearly, applying Proposition [3l since k{N) < 
N, we have 

It is enough to choose k{N) = \/iV to conclude the proof. • 
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